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ABSTRACT 

During initial transcription, RNA polymerase 
remains bound at the promoter and synthesizes 
RNA without movement along the DNA template, 
drawing downstream DNA into itself in a process 
called scrunching and thereby storing energy to 
sever the bonds that hold the enzyme at the 
promoter. We show that DNA scrunching also is 
the driving force behind the escape of RNA polymer- 
ase from a regulatory pause of the late gene operon 
of bacteriophage X, and that this process is 
enhanced by the activity of the antiterminator. 
Furthermore, we show that failure of transcription 
complexes to escape the pause results in back- 
tracking and arrest in a process analogous to 
abortive initiation. We identify a sequence element 
that modulates both abortive synthesis and the for- 
mation of arrested elongation complexes. 

INTRODUCTION 

The segment of DNA immediately downstream of a tran- 
scription start site frequently is an important site of regu- 
lation of transcription elongation, where the positive 
energies of the polymerization reaction encounter inhibi- 
tory forces that deter elongation. In one example, all 
multisubunit RNA polymerases (RNAPs) tend to abort 
early synthesis, presumably when polymerization fails to 
provide the energy to escape RNAP interactions with the 
promoter, thereby releasing abortive RNAs, typically 5- 
10 nt in length (1-4). Importantly, both eukaryotic and 
bacterial RNAPs can be restrained by specific protein 
interactions that make further elongation dependent on 
particular regulatory modifications. A prominent 
example is provided by eukaryotic RNAP II, which fre- 
quently stalls tens of nucleotides from the transcription 



start site due to the inhibitory activity of proteins like 
Dsif and Nelf, and then is rescued into elongation by 
the regulatory kinase pTefb in a reaction essential to tran- 
scription activation (5). 

The energetic transactions in early transcription elong- 
ation are particularly exposed in the bacterial regulatory 
system of the bacteriophage A gene Q antiterminator. 
becomes a subunit of RNAP, allowing it to resist termin- 
ators through both antipausing and a structural 
rearrangement that inhibits terminator RNA function 
(6,7). function requires an early transcription pause 
at +16 that is induced by a specific protein interaction, 
namely, binding of the a^^ initiation factor to a secondary 
— 10-Hke sequence in the early transcribed segment (8,9). 
Q'^ helps RNAP escape this pause (as well as critical 
pauses at terminators downstream) (8), exhibiting an 
essential antipausing activity that may exemplify the way 
regulatory factors can overcome early transcription 
barriers. 

The realization that initial transcribing complexes are 
formed through DNA 'scrunching' (10,11), namely, the 
unwinding of downstream DNA that then is incorporated 
as single-stranded DNA into the transcription complex, 
provides a structural context for understanding the 
nature of promoter-proximal pausing (e.g. a^^-induced) 
and the reversal of pausing by antipausing factors (like 
Q'^). This derives directly from the scrunching model of 
initial transcription and abortive initiation, as follows. A 
consideration of energies involved in synthesis and elong- 
ation of the initial transcribing complex suggests that 
scrunching stores energy that is used to break the a^^- 
DNA bonds that stabiHze the open promoter complex. 
Further energy is required to displace a protein linker 
between a^^ domains 3 and 4 (12,13); combined, these 
barriers oppose the elongation reaction and can lead at 
a significant rate to its failure, resulting in 'abortive initi- 
ation', the loss of the initial transcript from the complex. 
The extent and pattern of abortive transcript release 



*To whom correspondence should be addressed. Tel: +1 607 255 2430; Fax: +1 607 255 6249; Email: jwr7@cornen.edu 
© The Author(s) 2014. Published by Oxford University Press. 

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/ 
by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial 
re-use, please contact journals.permissions@oup.com 



5098 Nucleic Acids Research, 2014, Vol. 42, No. 8 



depends both on the strength of a^^ binding in the 
promoter and the nucleotide composition of the initial 
transcribed sequence (ITS) (3,14,15). 

A reasonable inference is that a^^-dependent paused 
complexes (Figure 1) have the essential features of initial 
transcribing complexes, including the scrunched DNA 
structure (16,17), with the central difference that the 
RNA of >16nt is not susceptible to abortive loss 
because it is stabilized by the interactions that also stabil- 
ize the elongation complex, i.e. enclosure of the RNA/ 
DNA hybrid and binding of RNA in its exit channel 
(18). The paused complex is anchored by a^^ interacting 
with the non-template strand of the secondary —10 
sequence (9) (Figure 1). However, the actual site of 
pausing, i.e. the extent of scrunching-based elongation 
after a^^ binds DNA, is determined by sequence 
elements that also likely determine the general features 
of elongation such as ubiquitous pausing (19-21). 

Modification of the a^^-dependent paused RNAP by 
protein, which binds to both a specific DNA binding 
site behind RNAP (22) and to elements of RNAP 
[including particularly the beta flap (23)], results in a 
stable complex of with RNAP (24) that changes the 
RNAP elongation properties. Binding of assists RNAP 



in escaping the pause, and the Q^-modified complex 
pauses less downstream, an essential element of the 
antitermination mechanism (25). Just as scrunching 
energy is proposed to break a^^ bonds with promoter 
DNA, we propose that Q'^ promotes further scrunching 
from the a^^-dependent pause that provides the energy to 
break <j^^ (and presumably Q'^) bonds with DNA. 

We have used a distinctive property of the a^^-pausing/ 
system to provide evidence for this extended 
scrunching model of Q'^ function, and also to reveal im- 
portant aspects of the sequence basis of early abortive 
transcript release. In addition to escaping into productive, 
antiterminated elongation, Q'^-modified elongation 
complexes downstream of the a^^-dependent pause site 
have a tendency to undergo arrest in a sequence-specific 
manner. We propose that this Q'^-dependent arrest is 
analogous to a step of abortive release of early transcripts: 
it depends on the anchoring activity of a^^ (in combin- 
ation with Q'^ in this case), and the RNAs are backtracked 
and sensitive to the action of GreB cleavage factor 
(Figure 1) (6), just as the formation and release of 
abortive initiation products are sensitive to GreB (26). 
The stability against dissociation of Q'^-modified 
complexes, which are mature elongation complexes. 
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Figure 1. A model for Q -stimulated escape from the A.pR' a -dependent pause. On engagement of the — 10-Hke element by a region 2, the 
transcription elongation complex undergoes a a^^-dependent pause. The paused elongation complex is a substrate for modification by Q^, which 
engages the elongation complex through contacts with the Q binding element and with RNAP itself. Assembly of into the elongation complex 
stimulates the resumption of transcription while a^^ is still bound to the — 10-hke element, resulting in an extended scrunch. The energy accumulated 
by DNA scrunching can be released in two ways: First, if the energy stored in the scrunch is insufficient to disrupt the interaction between a^^ and 
— 10-Hke element, RNAP backtracks into an arrested state, termed the QAC, and cleavage of the backtracked RNA must occur in order to resume 
transcription. Second, if the energy stored in the scrunch is sufficient to disrupt the interaction between a^^ and —10-hke element, the Q^-modified 
elongation complex disengages the pause site and resumes elongation. Presumably a^^ dissociates from RNAP following disengagement. 
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allows us to analyze their nature and sequence depend- 
ence. We discern important sequence elements that deter- 
mine the efficiency and site of arrest of Q'^-modified 
complexes, and we show explicitly that such sequences 
also determine the pattern of abortive transcript release 
from initial transcribing complexes. In particular, we 
identify a dinucleotide motif that determines a site and 
efficiency of major abortive RNA release. We further 
show that Q^-dependent arrest reflects backtracking to 
the site of the a^^-dependent pause where Q'^ initially 
engages; this result implies that the mechanism of 
Q'^-mediated release of RNAP from the a^^-dependent 
pause involves further scrunching from the site of the 
a^^-dependent pause. This result is consistent with the 
model that scrunching energy is stored in a stressed inter- 
mediate, which then functions in initiation to break the 
a^^-promoter bonds, but which is proposed to act here to 
break both a^^ and bonds to DNA and allow the 
modified complex to escape into elongation. 



MATERIALS AND METHODS 

Plasmids 



pM650 (27), p'QE-30 (28) and pET-28a- (29) have 
been described. pES3 is Gr^^-6xHis in pET-28b(+). 
pES7 is GreBD4iA, i?4^^-6xHis in pET-28b(+). pSAN2 (a 
gift from L. Hsu) contains pN25Anti. pVSlO (a gift from I. 
Artsimovitch) contains all four core RNAP subunits. All 
mutants were constructed using Quikchange site-directed 
mutagenesis. 



Proteins 



RNAP (30), GreB and GreBD4iA, e44a (31), a^^ (29) and 
NusA (32) were purified as described. was purified as 
described (17), except that it is stored in 10 mM potassium 
phosphate, pH 6.5, 50% glycerol, 200 mM potassium 
chloride, 1 mM ethylenediaminetetraacetic acid (EDTA) 
and 7mM tris(2-carboxyethyl)phosphine (T. Santangelo 
and J. Filter, unpublished data). 

In vitro transcription 

Reaction mixtures containing 2nM template and 20 nM 
RNAP (20 nM core reconstituted with lOOnM a^^) were 
incubated in transcription buffer (20 mM Tris-HCl, pH 
8.0, 0.1 mM EDTA, 1 mM dithiothreitol and 50 mM 
KCl), O.lmg/ml bovine serum albumin and 200 |iM 
ATP, OTP, CTP and 50|iM UTP (containing 0.5|iCi/|il 
a.32p.uxP) and 150nM NusA for 15min at 37°C to form 
open complex. Supplementary Figure S3B used a nucleo- 
tide mixture containing 200 |iM UTP, GTP, CTP and 
50|iM ATP (containing 1.0|iCi/|il y-^¥-UTP) and 
lOnM template. Single-round transcription reactions 
were initiated by addition of magnesium chloride to 
5mM and rifampicin to 10|ig/ml. Total reaction volume 
was 25|il. For abortive initiation assays (except that of 
Supplementary Figure S5B), open complex was formed 
as above, except that the NTP concentration was 
200 |iM ATP, UTP, GTP and 50 |iM CTP (or a dilution 
of this NTP mix) and contained 0.5|iCi/|il a-^¥-CTP). 







+ 


GreB 




+ 


D41A 
E44A 


+ 


1 D41A 
1 E44A 




I 



+25 



+16 



Time 
(min) 



0.25 1 
0.5 5 



Figure 2. Q^-dependent arrest at A.pR'. Single round in vitro transcrip- 
tion from A.pR' in the presence and absence of Q^, GreB and GreBD4iA, 
E44A- Readthrough (RT), terminated (T), QAC and paused transcripts 
(+16, +17) are shown. The QAC are designated by a bracket. +17 is 
included in the QAC because complexes trapped at +17 chase by 5 min 
in the presence of Q^, suggesting that they are distinct from the 
complexes trapped at +17 in the absence of Q^. Percentage readthrough 
was determined by normalizing the intensity of the RT and T bands for 
U content and dividing the normalized RT by the sum of the 
normalized RT and T. 



Figures 5D and 6B were repeated using three alternative 
NTP mixes (with ATP, UTP or GTP 50 |iM, and the re- 
maining NTPs at 200 |iM) to ensure that the NTP mix did 
not bias the abortive pattern. Supplementary Figure S5B 
was performed using 20 |iM UTP, GTP, CTP and 5 |iM 
ATP (containing 0.5 |iCi/|il oc-^^P-ATP). Multiround tran- 
scription was initiated by the addition of magnesium 
chloride to 5mM. When present, GreB was added to 
100 nM before formation of open complex. In experiments 
of Figure 2 and Supplementary Figures SI A and B con- 
taining GreBD4iA, e44A, both GreB and GreBD4iA, e44A 
were added to a concentration of 1 |iM in their respective 
reactions. When present, Q'^ was added to a concentration 
of 250 nM after open complex formation and incubated at 
37°C for 30 s before initiation. Reactions were stopped by 
adding 125 |il of stop solution (0.6M Tris, pH 8.0, 12 mM 
EDTA, 0.16mg/ml transfer RNA). 

Cleavage assays 

Biotinylated template was bound to Promega Streptavidin 
MagneSphere Paramagnetic Particles. Transcription was 
initiated in the presence of Q'^ and allowed to proceed for 
2 min to form the Q'^-dependent arrested complexes 
(QAC), at which point the reactions were washed three 
times with T buffer + 0.1 mg/ml bovine serum albumin to 
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remove nucleotides and magnesium chloride, halting the 
reaction. For nucleotide starvation cleavage, reactions 
were then resuspended in T buffer + lOOnM GreB, 
incubated at 37°C, and initiated by addition of magnesium 
chloride to 5 mM. To cleave backtracked RNAs and chase 
the 5^ cleavage products, reactions were resuspended in T 
buffer, lOOnM GreB and 200 |iM NTPs, incubated at 
37°C, and initiated by addition of magnesium chloride 
to 5mM and rifampicin to 10|ig/mL Reactions were 
stopped by adding 125 |il of stop solution. 

Heteroduplex templates 

Heteroduplex templates were constructed as described 
previously (33) and sequenced to confirm purity. 
Transcription was performed as described above. 

Purification, fractionation and analysis of transcription 
reactions 

Stopped transcription reactions were phenol extracted by 
addition of 150|il of phenol/chloroform/isoamyl alcohol 
(25:24:1), vortexing, centrifugation and collection of the 
aqueous phase. Ethanol precipitation of RNA was per- 
formed by adding 450 |il of 100% ethanol to each 
reaction, followed by storage at — 20°C overnight. 
Precipitated RNA was resuspended in transcription 
loading dye (Ix T buffer, 80% formamide, 0.05% 
bromophenol blue and xylene cyanol). Reactions were 
fractionated by electrophoresis using 12 or 1 5% denaturing 
polyacrylamide gels containing 6M urea. Reactive bases 
were detected by an Amersham Biosciences Typoohn 
9400 Variable Mode Imager. Quantitation was performed 
using ImageQuant. '%Q^-modified' refers to the per- 
centage of a subset of Q'^-modified complexes out of all 
Q'^-modified complexes (QAC + readthrough). '% Non- 
Abortive Transcripts' refers to the percentage of a particu- 
lar transcript length out of all non-abortive transcripts, 
thus excluding abortive products. 

RESULTS 

induces formation of a backtracked early transcription 
arrest 

Inclusion of Q'^ protein during in vitro transcription from 
A.pR' causes modification by Q'^ of 70% of the transcrip- 
tion complexes (Figure 2); of these, 52% appear as termin- 
ator readthrough owing to the antitermination property of 
Q'^, and the remaining 48% are trapped in a backtracked 
state — termed the QAC. The QAC include species of 
17-19 and 24-28 nt (Figures 1 and 2, bracket); they are 
eliminated by addition of the transcription cleavage factor 
GreB, resulting in all Q'^-modified complexes appearing as 
readthrough (Figures 1 and 2). Although the QAC appear 
largely stable over 5min (Figure 2), they disappear on 
longer incubation (Supplementary Figure SI A). We 
earlier surmised that the QAC are paused (16), implying 
an abihty to resume elongation, but it is more Hkely that 
their disappearance results from intrinsic transcript 
cleavage (or possibly trace contamination by GreA or 
GreB) that rescues them into productive elongation. 



As further evidence that the QAC are backtracked and 
rescued by transcript cleavage, we used a catalytic 
mutant of greB that changes both essential carboxylates 
(D41 and E44) to alanine (34), resulting in a mutationally 
altered protein that inhibits the active center-dependent 
intrinsic cleavage reaction (Supplementary Figure SIB). 
Inclusion of GreBD4iA, e44A slows the elongation rate sub- 
stantially, but also traps 90% of the Q^-modified 
complexes as QAC (Figure 2 and Supplementary Figure 
SI A). Interestingly, GreBD4iA, e44a also causes 97% of 
the total transcript to appear as abortive products 
(Supplementary Figure S2A and B). 

drives escape from the h^W a^^-dependent pause by 
stimulating DNA scrunching 

The identity of the first 20 nt following a transcription start 
site (the ITS) is a major determinant of the pattern and 
quantity of aborted transcripts at a promoter (15). Because 
abortive initiation occurs from a scrunched complex, the ITS 
presumably affects the stability of the scrunched complexes 
as transcription proceeds. Similarly, if Q'^-dependent arrest 
is analogous to abortive initiation, the sequence downstream 
of the a ^-dependent pause should modulate the length and 
distribution of the RNAs in the QAC. Furthermore, RNAP 
in the QAC should remain stably bound at the a^^-depend- 
ent pause, just as an initial transcribing complex would 
remain bound at the promoter following a failed attempt 
at promoter escape. 

To determine the role of sequences downstream of the 
a^^-dependent pause in Q'^-dependent arrest, we replaced 
positions +17 to +36 of ApR' with the ITS of promoter 
N25Anti to create a template termed ApR'17N25A 
(Figure 3A). N25Anti is a strongly abortive promoter 
known to form abortive transcripts up to 15nt in length 
(26,35), making it an ideal model for studying our analogy. 
If the sequence downstream of the a^^-dependent pause 
affects Q'^-dependent arrest in the same way that the ITS 
affects abortive initiation, then substitution of this down- 
stream sequence with an ITS that generates long abortive 
products should result in the formation of QAC that 
contain longer RNAs as a result of transcription further 
downstream before collapse into a backtracked state. 

Figure 3B depicts single-round transcription reactions 
from both ApR' and ApR'17N25A in the presence and 
absence of Q^. In the presence of Q'^, elongation 
complexes transcribing from ApR' undergo Q^-dependent 
arrest after transcribing to a cluster of positions focused at 
+25, whereas elongation complexes transcribing from 
ApR'17N25a undergo Q'^-dependent arrest after transcrip- 
tion to a cluster focused at +30 and +31. The 30-nt RNA 
present in these QAC is exactly 14nt longer than the a^^- 
dependent pause product and thus corresponds in 
endpoint to a major 14-nt long abortive product of the 
N25Anti promoter. This result supports the interpretation 
that the N25Anti ITS alters Q'^-dependent arrest in a way 
that reflects its abortive properties. 

Complexes that have undergone Q'^-dependent arrest 
are backtracked, and thus are sensitive to the cleavage 
factor GreB (Figure 2). As expected, the ApR'17N25A 
QAC are GreB-sensitive, indicating that they are 
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Figure 3. Sequence dependence of the QAC. (A) Sequence comparison 
of A.pR' and A.pR'17N25A from positions +1 to +36. (B) Single round 
in vitro transcription from A.pR' and A.pR'17N25A in the presence and 
absence of Q^. 



backtracked (Supplementary Figure S3). It is noteworthy 
that in addition to the effects on Q'^-dependent arrest, 
complexes transcribing ApR'17N25A undergo a pause at 
+30 in the absence of that is insensitive to GreB. The 
appearance of such a pause suggests that the N25Anti ITS 
has an innate preference for pausing at this position, 
which may influence its abortive properties and its 
behavior in the fusion ApR'17N25A. 

DNA scrunching requires that RNAP remain bound at 
a defined position while synthesizing RNA without 



translocation. In the context of initial transcription, 
failure to escape the promoter causes abortive release of 
the nascent transcript through a process thought to be 
similar to backtracking (15,26). In our model, the 
increase in DNA scrunching that results from modifica- 
tion of the a^^-dependent paused complex by Q'^ occurs 
while a^^ is bound to the pause-inducing sequence; thus, 
collapse during further scrunching should result in a Q'^- 
modified elongation complex still bound through a^^ 
interaction at the pause-inducing sequence, and arrested 
in a backtracked conformation. To determine the position 
to which the QAC of ApR' and ApR'17N25A backtrack, 
we first performed GreB-mediated cleavage in the absence 
of NTPs (Figure 4A). We generated QAC on template 
bound to magnetic beads, washed to remove NTPs and 
induced cleavage by incubating with GreB. In addition to 
an experiment in which RNAs were internally labeled, we 
performed a second nucleotide starvation cleavage experi- 
ment in which RNAs were end-labeled to allow for clear 
visualization of the 5^ cleavage products (Supplementary 
Figure S3B). Strikingly, both ApR' and ApR'17N25A QAC 
backtrack to a range of positions from +11 to +16, which 
is consistent with the previously established range of 
positions to which a^^-dependent paused complexes back- 
track on ApR' (16). Thus, the observation that the ApR' 
and ApR'17N25A QAC backtrack to the same positions 
indicates that they remain bound at the a^^-dependent 
pause regardless of the position to which they have 
transcribed. 

To confirm our interpretation of the nucleotide 
starvation cleavage results, we examined the y cleavage 
products that are generated when the QAC are 
treated with GreB: If ApR' and ApR'17N25A QAC 
backtrack to positions from +11 to +16, then treatment 
with GreB should yield y cleavage products of 9-14 and 
14-20 nt, respectively. We generated QAC on template 
bound to magnetic beads, washed to remove NTPs 
and incubated in the presence of GreB and unlabeled 
NTPs to stimulate cleavage and chase the 5^ cleav- 
age products into longer RNAs. As expected, cleavage 
of the QAC on ApR' yields RNA of -9-14 nt in length, 
and cleavage of the QAC on ApR'17N25A yields RNA of 
—14— 20nt in length (Figure 4B). It is important to note 
that the long y cleavage products are Q'^-dependent, 
indicating that they originate from the QAC that 
contain long RNAs, thus, confirming the interpretation 
that the QAC backtrack to a cluster of positions from 
+ 11 to +16 regardless of the position to which they 
transcribe. 

Together, the Q'^-dependent nature of these complexes 
and the position to which they backtrack indicate that 
they result from Q^-modified elongation complexes that 
attempt and fail to escape from the a^^-dependent 
pause. The observation that the ApR' and ApR'17N25A 
QAC originate from the a^^-dependent pause, yet tran- 
scribe to different positions before backtracking, 
estabhshes that the sequence composition immediately fol- 
lowing the a^^-dependent pause is a critical determinant of 
Q'^-dependent arrest and is consistent with a mechanism of 
DNA scrunching. 
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Figure 4. QAC backtrack to +15 regardless of the positions to which 
they have transcribed. (A) Cleavage of the A.pR' and A.pR'17N25A 
QAC. After formation of the QAC, NTPs were removed and GreB 
was added. Samples were incubated at 37° C and removed at the 
indicated times. 5' and 3' cleavage products were differentiated by the 
assay shown in (B) and Supplementary Figure S3B. (B) y cleavage 
products of the A.pR' and A.pR'17N25A QAC. As in (A), except tran- 
scription was resumed in the presence of GreB and NTPs. The exact 
sizes of y cleavage products are uncertain within about a nucleotide 
because they have 5'-monophosphate rather than 5'-triphosphate ends 
Hke the other transcripts. However, the approximate sizes of the A.pR' 
and A.pR'17N25A 3' cleavage products are both distinct and within the 
expected ranges. 



A systematic mutant scan reveals a sequence element 
responsible for the long abortive products of the N25Anti 
promoter and the extended Q-dependent arrest 

The ability of the ITS to change the position to which 
Q'^-modified elongation complexes transcribe before 



backtracking and arrest indicates that a sequence 
element within the N25Anti ITS is responsible for 
determining the extent of scrunching that occurs during 
escape from the a^^-dependent pause and the N25Anti 
promoter. To reveal any such elements, we performed a 
systematic mutant scan that covered the positions +17 to 
+31 of the ApR'17N25A template (Figure 5A). The scan 
consisted of overlapping 3 bp substitutions in which we 
made A/C and G/T transversions. We then performed 
single-round in vitro transcription to assay for altered 
Q'^-dependent arrest. Strikingly, two overlapping 
mutants showed a decrease in the amount of complexes 
arrested at +30 and the appearance of complexes at +24, 
1 nt shorter than the wild-type position of +25 
(Figure 5B). These mutants shared G to T mutations in 
positions +21 and +22 (positions +5 and +6 of the N25Anti 
ITS). To establish that the +21 and +22 nt are entirely 
responsible for the restoration of a shorter Q^-dependent 
arrest, we constructed ApR'17N25a-tt5 ^ A.pR'17N25A 
mutant containing the +21 and +22 G to T mutations. 
As expected, in the presence of Q'^, the majority of QAC 
clustered around position +24 rather than +30 
(Figure 5C). 

We then performed the mutant scan described above on 
A.pR' from positions +16 to +25 to determine whether a 
similar sequence element existed within the wild-type ApR' 
sequence (Supplementary Figure S4A). The introduction 
of a G-rich region at positions near +21 and +22 shifted 
the primary position of arrest from +25 to +27, with some 
complexes appearing at +31 and +32 (Supplementary 
Figure S4B). Thus, introducing the GG sequence 
element from N25Anti is sufficient to make the arrest 
pattern of ApR' similar to that of ApR'17N25a; in fact, 
no other portion of the N25Anti sequence has significant 
effect on the arrest, although the exact pattern of arrest 
sites varies slightly (Supplementary Figure S4C and D). 

If there is a relationship between the effect of the 
N25Anti ITS on Q'^-dependent arrest and the abortive 
properties of the N25Anti promoter, mutations that 
disrupt the extended Q'^-dependent arrest of 
ApR'17N25A should affect the long abortive transcripts 
of the N25Anti promoter in a similar manner. To test 
this, we introduced G to T mutations at positions +5 
and +6 of the N25Anti promoter. Multiround in vitro tran- 
scription revealed that the N25a-tt mutant primarily ac- 
cumulates abortive products of 6, 7, and 8 nt and displays 
a dramatic reduction of 13 and 14nt aborted transcripts 
(Figure 5D). The effect of the +5, +6 G to T mutations is 
enhanced at low nucleotide concentrations, but persists up 
to the standard concentration of our transcription reac- 
tions. This shift is analogous to the shift observed in the 
QAC of the ApR'17N25a-tt mutant, further indicating 
that the abortive properties of the N25Anti ITS are 
reflected in its effects on Q^-dependent arrest and thus 
supporting the model that scrunching is the mechanism 
by which Q'^-modified elongation complexes escape the 
a^^-dependent pause. In fact, a scan by overlapping 3-nt 
substitutions in the N25Anti ITS confirms that only the +5 
and +6 doublet is important to the abortive pattern 
(Supplementary Figure S5A and B). [We ignore the tran- 
scripts of mutants 1, 2 and 3, which are obscured by 
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stuttering due to the T run in the beginning of the 
transcript (36)]. 

Strand specificity of the N25Anti + 5/ + 6 Mutation 

The template and non-template strands of a DNA duplex 
have distinct functions and contacts within a transcribing 
complex. Such asymmetry means that the effect of a 
mutation can be assigned to the template, non-template 
or both strands of DNA. It is possible to differentiate the 
activity of each strand by analyzing the behavior of 
heteroduplex templates. The availability of a 2-bp 
mutation that generates arrest (in the context of 
A.pR'17N25a) and abortive (in the context of N25Anti) 
patterns that are easily distinguishable from those of the 
original N25Anti sequence allowed us to create 
heteroduplex templates and evaluate the strandedness of 
the N25Anti +5T, +6T mutant effects. 

Single-round in vitro transcription was performed on 
^pR'17N25a, ApR'17N25a-tt and the corresponding 
heteroduplexes. Figure 6 A shows that the template 
strand dictates the position of Q'^-dependent arrest: 
when A.pR'17N25A is in the template strand, Q'^-depend- 
ent arrest is identical to that of the ApR'17N25A 
homoduplex, whereas when A.pR'17N25a-tt is in the 
template strand, Q'^-dependent arrest is identical to that 
of the A.pR'17N25a-tt homoduplex. 



We then performed a heteroduplex analysis of N25Anti 
and N25a-tt to determine the strandedness of the +5T, 
+6T mutations in the context of abortive initiation 
(Figure 6B). Neither heteroduplex produces an abortive 
profile like that of the parental homoduplexes. 
Moreover, heteroduplexes containing a mutant non- 
template strand form a unique abortive profile, indicating 
that the ITS non-template strand is also a determinant of 
abortive transcription. The contribution of the non- 
template strand in the context of abortive initiation and 
the absence of such an effect in the context of Q'^-depend- 
ent arrest suggests that while the underlying mechanism is 
shared, differences between a promoter-bound initial 
transcribing complex and a Q'^-modified a^^-dependent 
paused complex affect the ways in which ITS composition 
modulates escape. 

Strengthening the interaction between and the 
— 10-like element increases the quantity and alters the 
distribution of both Q^-dependent arrest and 
a^^-dependent pausing 

In addition to the ITS, the strength of the interaction 
between RNAP and the promoter elements plays a role 
in the formation of abortive products, with stronger pro- 
moters tending to yield more aborted transcripts (14). If 
the mechanism of Q'^-dependent arrest is in fact analogous 
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to abortive initiation, strengthening the interaction 
between a^^ and the — 1 0-Hke element should increase the 
likelihood of Q'^-modified transcription elongation 
complexes undergoing Q^-dependent arrest. To test this, 
we created a A.pR' mutant, A.pR'— lOLikccons? that 



contains the —10 element consensus sequence 'TATAAT' 
in place of the wild-type 'AACGAT' in positions +1 to +6 
(Figure 7A). In ApR', substitution of a consensus — 10-like 
element in place of the wild-type sequence creates an 



especially strong interaction with a^^ due to the 



Nucleic Acids Research, 2014, Vol. 42, No. 8 5105 



presence of a GGG 'discriminator-like' sequence (37). 
Single-round in vitro transcription reveals that a consen- 
sus — 10-like element increases the quantity and alters the 
distribution of both Q^-dependent arrest and a^^ pausing 
(in addition to shifting the ladder down 1 nt because ini- 
tiation occurs at the A of position 2 relative to wild-type) 
(Figure 7B and Supplementary Figure S6A). 

The effect of the ApR' — lOLikecons on Q'^-modified 
elongation complexes is to broaden the range of species 
captured during Q'^-dependent arrest from 17-19 to 24-28 
so that QAC appear at all positions from 17 to 28 
(Figure 7B and Supplementary Figure S6A). We propose 
that the expanded range of QAC on ApR' — lOLikccons is 
the result of the strengthened interaction between a^^ and 
the — 10-Hke element trapping complexes that would 
normally bypass Q'^-dependent arrest and causing them 
to collapse into a backtracked state. We constructed a 
— 10-like consensus mutant of A.pR'17N25A, termed 
ApR'17N25a — 10Likecons5 to t^st whether strengthening 
the interaction between a^^ and the —10-like element 
would have similar effects on mutant QAC (Figure 7C). 
As expected, A.pR'17N25A — lOLikccons expands the QAC 
from 17-19 and 29-31 (on ApR'17N25a) to 17-34 
(Figure 7D). It is striking that the pattern of Q-dependent 
arrest that ApR'17N25A — lOLikccons displays from +17 
to +30 (Figure 7D) is similar to the abortive release 
pattern of N25Anti from +1 to +14 (Figure 5D), which is 
the same sequence of DNA; note particularly the peak of 
arrest at +23 of ApR'17N25A— lOLikccons, which corres- 
ponds to +7 in N25Anti- 

In the absence of Q'^, elongation complexes undergo 
a^^-dependent pausing on both ApR' and the 
ApR' — lOLikecons mutant, as shown by bands at +16 
and +17. ApR' — lOLikccons captures complexes with 
greater efficiency, as would be expected from a mutation 
that increases affinity for a^^ (Supplementary Figure S6B 
and C). In addition to an increase in pausing, 
ApR' — lOLikecons captures several-fold more complexes 
that have transcribed to +18 and +19 as well. This effect 
appears to be similar, albeit less dramatic, to the broaden- 
ing of Q'^-dependent pausing in the same mutant. 



DISCUSSION 

We have shown that two determinants of the efficiency 
and pattern of promoter escape and abortive initiation, 
namely, the nucleotide composition of the ITS and the 
strength of the interaction between a^^ and DNA, also 
underlie Q'^-dependent escape from the a^^-dependent 
pause at ApR' and the formation of the QAC. We 
conclude that both Q'^-dependent escape from the a^^-de- 
pendent pause and escape of initial transcribing complexes 
from the promoter share a common mechanism of 
advance: DNA scrunching. Thus, we have shown that 
DNA scrunching occurs in a context outside of 
promoter escape and, furthermore, can be modulated by 
a ^ra^^-acting factor, in this case the Q'^-antiterminator. 
In addition, we identify a sequence element that is an 
important determinant of the pattern of abortive release 
of RNAs. 



Mechanism of Q^-dependent release of paused complexes 
into elongation 

At ApR', substitution of the N25Anti ITS in place of the 
WT sequence from +17 to +36 changes the position to 
which Q'^-modified complexes transcribe before undergo- 
ing Q'^-dependent arrest, from a cluster focused at +25 to 
a cluster focused at +30 and +31. However, on both tem- 
plates, RNAP remains bound to the —10-like element 
because the active center is found at +15 in both the 
WT and mutant QAC. Thus, the composition of the 
20 bp immediately downstream of the a^^-dependent 
pause site dictates the position of the y end of the RNA 
bound within the QAC, but does not affect the position to 
which RNAP backtracks. These characteristics are con- 
sistent with a mechanism of scrunching, in which RNA 
is synthesized without movement of RNAP along DNA, 
but instead DNA is melted and drawn into the enzyme. 

The role of the ITS in abortive synthesis is well defined, 
and it is clear that the base composition of positions +1 to 
+20 of transcribed DNA dictates the positions at which 
abortive transcripts are released. Similarly, sequences 
downstream of the a^^-dependent pause determine where 
Q'^-dependent arrest occurs; we suggest that in both 
processes, DNA sequence sets the transcription pattern 
by determining the stability of scrunched complexes. The 
relationship between the effects of the N25Anti ITS on 
abortive synthesis and Q'^-dependent arrest supports the 
interpretation that Q'^ promotes escape of RNAP from the 
a^^-dependent pause as an advancing scrunched complex, 
accumulating energy of scrunching that is used to break 
a^^ or a^^ and Q'^ interactions with DNA and release 
modified RNAP into downstream elongation. 

Increasing the strength of the interaction between 
RNAP and the -10-like element of the ApR' a^^-depend- 
ent pause, by substituting the wild-type sequence AACGA 
T with the consensus — 10-element TATAAT, expands the 
region in which elongation complexes are captured, result- 
ing in Q'^-dependent arrest from +17 to +28 instead of the 
wild-type positions of +24 to +28. Similarly, when a con- 
sensus — 10-like element is introduced into ApR'17N25A, 
Q'^-dependent arrest occurs at all positions between +17 
and +34, as opposed to arrest primarily at +17 to +19 and 
a cluster around +30 in the presence of the wild-type ApR' 
— 10-Hke element. The changed pattern of arrest between 
wild-type and consensus —10-like elements gives insight 
into the natural process of Q'^-dependent escape from 
the a^^-dependent pause. We propose the following 
interpretation. 

First, we note that the consensus —10-Hke element in- 
creases Q'^-dependent arrest relative to the wild-type —10- 
like element in regions near the +16 pause, namely, +17, 
+ 18 and +19, and also invokes novel arrest sites in the 
region +20 to +23. What is the origin of this enhanced 
arrest? It must represent complexes that with the wild-type 
pause-inducing sequence would either continue scrunching 
or break the a ^ and Q^ bonds and continue into product- 
ive Q'^-modified elongation. However, it is implausible 
that the consensus —10-Hke element would disfavor 
continuing the scrunch because it should only increase 
the strength of the a^^-DNA bond that must be 
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maintained during the scrunch. Thus, it follows that the 
increase in arrest due to the consensus element must reflect 
complexes that, when bound to the wild-type- 10-Hke 
element, have sufficient scrunching energy to release a^^ 
and Q'^ bonds with DNA and proceed into productive Q^- 
modified elongation. In the presence of a consensus —10- 
hke element, the scrunching energy required for pause 
escape is increased so that these complexes no longer 
have sufficient energy to break the interaction between 
a^^ and the DNA, and instead, backtrack and arrest. 
Thus, the sites of enhanced arrest with the consensus 
— 10-Hke element mark natural sites where a^^ dissociates 
from the —10-Hke element and where productive Q'^- 
modified elongation begins. 

On exceeding some limit of stability, the scrunched 
complex becomes prone to collapse and backtracks with 
high probabiHty, whether the wild-type or consensus —10- 
like element is present, resulting in Q'^-dependent arrest at 
RNA lengths 24-28. As evidenced above, this limit is sen- 
sitive to the nature of the 20 bp of sequence immediately 
foUowing the site of the a^^-dependent pause; for example, 
introduction of the N25Anti ITS changes the site of arrest 
to lengths around +30. Thus, these results reveal the 
natural process of Q'^-dependent escape into elongation. 

Nature of abortive initiation 

In addition to the similarities between an initial- 
transcribing complex and a a^^-dependent paused 
complex, there are also discrete differences, most notably 
the contacts between a^^ and the core, the presence of a 
mature RNA, and, when present, the contacts with Q^. 
The availability of a second system besides promoter ini- 
tiation in which RNAP must break an interaction between 
a^^ and DNA to proceed forward provides a structurally 
distinct context in which to examine the elements that 
contribute to promoter escape. Such an analysis can 
clarify the role of these elements and may lead to a 
more general understanding of abortive synthesis. 

The existence of abortive initiation has been attributed 
to two distinct phenomena: scrunching and displacement 
of the a^"^ loop by the emerging RNA product (10-13). A 
further barrier that could contribute to abortive initiation 
is the displacement of a"^ that occurs as the RNA reaches 
14-15 nt in length (38). All of these processes would store 
energy as the initial transcript grows, and the failure of 
any to continue could lead to abortive loss of RNA. 
However, neither a^"^ loop displacement nor a"^ displace- 
ment can be involved in QAC formation because the RNA 
is too long, leaving scrunching as the only plausible 
process to account for QAC formation; therefore, our 
demonstration that the pattern of abortive release is re- 
constructed in the pattern of QAC formation, specifically 
in the case of ApR'17n25a — lOLikccons, suggests that 
scrunching is the dominant process that forms the 
pattern of abortive synthesis at this promoter. 

Our investigation of the QAC has provided a pathway 
to understanding a sequence basis of the pattern of 
abortive transcript release. Previous work has shown, 
first, that stronger promoter consensus elements 
(e.g. —35, —10 and discriminator elements) increase the 



length and yield of abortive products (14), and, second, 
that the ITS determines both the pattern and level of 
abortive release (15). It was shown previously that there 
is a correlation between purine-richness of the ITS and 
abortive tendency (15), but no specific sequence elements 
have been identified. We have used a systematic scan 
across the N25Anti ITS to find elements that affect QAC 
formation, and then to make inferences about ITS 
function in abortive transcript release. 

A pair of G to T mutations at positions +5 and +6 of 
the N25Anti ITS (+21 and +22 in ApR'17N25a) shifts the 
position to which RNAP transcribes before Q^-dependent 
arrest from a cluster at +30 to a cluster at +24, indicating 
that the identity of these 2 nt is essential to the properties 
of the N25Anti ITS. The critical nature of these positions is 
reflected by their function in abortive initiation in the 
N25Anti promoter: introduction of the +5, +6 G to T mu- 
tations into N25Anti alters abortive synthesis in a manner 
equivalent to the effect of the corresponding changes in 
ApR'17N25A on Q'^-dependent arrest. The bulk of N25Anti 
abortive products are 7, 13 and 14nt in length, with minor 
abortive products occurring at aH positions between +2 
and +15. The mutant N25a-tt promoter displays 
strongly decreased abortive transcripts at +13 and +14, 
and instead accumulates many more abortive transcripts 
of 6, 7 and 8 nt in length. A reasonable conjecture is that 
the uridine-richness at +5, +6 near the end of the 6, 7 and 
8 transcripts destabilizes the DNA/RNA hybrid and 
promotes dissociation. The decrease in 13 and 14 nt 
aborted RNAs in N25a-tt is likely the result of increased 
abortive release at +6, +7 and +8 reducing the number of 
transcripts that proceed to +13 and +14; the ratio of tran- 
scripts at +12 to +15 to fuH-length transcripts, i.e. the 
probability of abortive stalling at +12 to +15 is constant 
across the overlapped set of 3-nt substitutions that dras- 
ticaHy changes the ratio of +12 to +15 to smaller abortive 
transcripts. The abortive pattern is sensitive to NTP con- 
centration: low NTP concentrations favor the accumula- 
tion short abortive products, whereas high NTP 
concentrations facilitate the formation of longer abortive 
products and partly restore the 13 and 14nt abortive tran- 
scripts in N25a-tt- This trend impHcates the rate of tran- 
scription as a key determinant of abortive synthesis and 
suggests that other influences on transcription rate, e.g 
interaction with DNA-bound factors, influence promoter 
escape and abortive synthesis. 

It is not surprising that the differences between these 
complexes would produce subtle variations in the way 
DNA scrunching is modulated by the ITS. This is 
evident in a heteroduplex analysis of the N25Anti and 
N25a-tt ITSs in both contexts. Transcription on 
ApR'17N25A and ApR'17N25a-tt heteroduplexes reveals 
that the effect of ITS composition on formation of the 
QAC is solely dependent on the template strand. 
However, in the context of initial transcription at 
N25Anti and N25a-tt5 neither heteroduplex produces an 
abortive pattern identical to either of the homoduplexes 
from which they are derived. Thus, the effect of ITS com- 
position on the QAC is clearly attributable to the template 
strand, whereas both the template and non-template 
strands modulate abortive initiation. It is plausible that 
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this difference is a reflection of the numerous structural 
distinctions between a a^^-dependent paused complex and 
an initial-transcribing complex that affect the interactions 
between RNAP and the DNA template. 
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